A Robust Feature Extraction for End Point Detection in the Nonstationary Noisy Environment
نویسندگان
چکیده
This paper proposes a robust feature extraction for voice activity detection (VAD) in noisy environments where nonstationary noises exist. The accuracy of the VAD is drastically reduced because the fluctuation of features at the noise intervals causes false alarm rate to being increased. In this paper, in order to improve the VAD accuracy, harmonic-weighted energy is proposed. This feature extraction method focuses on voiced speech interval and weights the amount of the ‘harmonicity’ to the averaged energy of the frame input. To evaluate the performance of the proposed feature extraction method, receiver operating characteristic curves and equal error rate are measured. From the results, it is proved that the proposed method is the discriminative feature for VAD. Keywords— voice activity detection, robust feature extraction, harmonic to noise ratio
منابع مشابه
Speech Recognition in Noisy Environment Using Different Feature Extraction Techniques
In this paper, different feature extraction methods for speech recognition system such as Melfrequency cepstral coefficients (MFCC), linear predictive coefficient cepstrum (LPCC) and Bark frequency cepstral coefficients (BFCC) are implemented and the comparison is done based on average recognition accuracy. We suggest a noise robust isolated word speech recognition system which can be applied i...
متن کاملAccurate Fault Classification of Transmission Line Using Wavelet Transform and Probabilistic Neural Network
Fault classification in distance protection of transmission lines, with considering the wide variation in the fault operating conditions, has been very challenging task. This paper presents a probabilistic neural network (PNN) and new feature selection technique for fault classification in transmission lines. Initially, wavelet transform is used for feature extraction from half cycle of post-fa...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملAcoustic and Data-driven Features for Robust Speech Activity Detection
In this paper we evaluate different features for speech activity detection (SAD). Several signal processing techniques are used to derive acoustic features that capture attributes of speech useful in differentiating speech segments in noise. The acoustic features include short-term spectral features, long-term modulation features both derived using Frequency Domain Linear Prediction (FDLP), and...
متن کاملEvaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech
In this paper a variety of front-end configurations are evaluated on Hungarian telephone speech databases. Our aim was to measure directly the efficiency of the front-ends on real noisy and normal speech data. As a baseline the ETSI ADSR standard front-end is used. Some simplification on the standard is introduced resulting in better performance on our databases than the original front-end in t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014